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LOUDNESS DETERMINED BY POWER SUMMATION 
by Walton L Howes 
Lewis Research Center 

SUMMARY 

The predicted overall loudness of steady, broad-band noise is usually computed by 
summing weighted loudnesses of subbands of the noise intensity (mean-square pressure) 
spectrum. It is proposed, instead, that the overall loudness should be computed by 
summing weighted intensities of subbands ("critical bands") of noise and then obtaining 
the loudness of the sum. The proposed computation method seems to yield better agree- 
ment with published loudness judgment data than does the usual method. It appears that 
the proposed method should also yield better agreement with annoyance judgments than 
does the "perceived noise" method of Kryter. 


INTRODUCTION 

The magnitude of the auditory sensation produced by a sound stimulus defines its 
loudness. If the attention of an individual is consciously directed toward perceiving the 
sound, then the individual is capable of establishing quantitative relations among the 
subjectively judged loudnesses of different sounds. The remaining problem is to relate 
these subjective judgments to the physical attributes of the sounds so that the loudness 
of any sound can be predicted. 


REVIEW 

For pure tone stimuli (specifically 1 kHz) Knauss proposed (ref. 1) on the basis of 

test data at suprathreshold levels that loudness & is proportional to the 1/3 power of 

the acoustic intensity. S. S. Stevens recognized (ref. 2) that the word intensity should 

refer to mean-square pressure, which is really the quantity more closely related to 

sound measurements. Even more specifically, in practice the squared sound pressure 
o 

p is averaged over a finite time which should correspond to the auditory integration 



time («0. 2 s). Hence, test data indicate that for a pure tone 


(» 

where s « 1/3, and the tilde denotes an average over the auditory integration time. 

The proportionality constant k implicity accounts for the physical power transmitted to 
the auditory cortex. This transmittance is a function of frequency and, over some fre- 
quency ranges, is a function of the mean-square sound pressure as well (ref. 3). The 
transmittance is a maximum in the general neighborhood of 1 kilohertz, that is, within 
the frequency interval in which the auditory system is most sensitive. 

The loudness of sounds which include a broad spectral range of frequencies can also 
be evaluated quantitatively by subjective methods. However, the establishment of a 
correct relation between loudness and physical characteristics has proved more elusive 
for broad-band sounds than for pure tones. Although the unweighted, or the A or B 
weighted, sound-pressure level of a broad-band sound is sometimes a good measure of 
its loudness level, this is not generally true. (In acoustics, "level" implies that the 
magnitude is expressed in decibels. ) 

Historically, the first serious attempt to predict the loudness of sounds appears to 
have been by Steinberg (ref. 4). Steinberg's empirical loudness formula - based on the 
erroneous Fechner law, rather than equation (1) - was so general that it incorporated 
implicitly the presently recognized constituents, namely, physical power, transmittance 
function, power summation, and critical bands, required for a satisfactory loudness 
prediction formula. Unfortunately, the experimental data available to Steinberg were 
unsatisfactory. Consequently, when Fletcher and Munson (ref. 5) published much im- 
proved loudness judgment data and their own loudness calculation procedure based on 
loudness summation, Steinberg's formula was thenceforth disregarded. The acceptance 
of loudness summation by subsequent investigators has persisted for nearly 40 years, 
and "loudness summation" has become a common term in psychoacoustics. In the 
present context loudness summation implies that the overall loudness of a broad-band 
noise is determined by summing the loudnesses of its individual components in the form 
of pure tones or subbands of the overall spectrum. Loudness summation has no theoret- 
ical basis and has never been claimed to have one (ref. 6). 


ARGUMENT FOR POWER SUMMATION 

Herein, it will be shown that the overall loudness of broad-band noise is not deter- 
mined by loudness summation, but rather by power (more specifically, average power) 
summation over bandwidths equal to the so-called "critical" frequency bands defined 
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and specified by Zwicker, F lot tor p, and Stevens (refs. 7 and 8). Subjective loudness 
judgment tests (ref. 8) have shown that, surrounding every audible frequency, there 
exists a so-called critical frequency band. All sounds which possess the same sound 
pressure and do not exceed the local critical bandwidth are equally loud. If the band- 
width of a sound is widened beyond the limits of the local critical bandwidth while the 
sound pressure is held constant, then the loudness of the sound increases. In fact, 
these conditions define the critical band. 

As a basis for the claim that power summation should supersede loudness summa- 
tion, note that the phenomena which occur in the auditory system are certainly physical 
phenomena (not psychological phenomena) involving the transmission of information to 
the brain. The transmission of information requires the transmission of physical power. 
The psychoacoustic quantity called loudness is an intensive quantity which for a pure 
tone is proportional to a numerical power of the relative transmitted physical power TT 
averaged over the auditory integration time, that is, 

*= *r[ s (2) 

where k is a proportionality constant. This is so because of equation (1) and the fact 
that, for sound waves, average physical power is proportional to mean-square pressure. 
Hence, it is to be expected that power transmitted to the higher nervous system consti- 
tutes the physical embodiment of loudness. All physical operations within the brain, 
including summations, must involve physical power. Hence, it is expected as an exten- 
sion of equation (1) and the existence of critical bands that power summation (as opposed 
to loudness summation) determines the overall loudness. (The critical band phenomenon 
implies that eq. (1) is valid, not only for pure tones, but also for complex sounds not 
exceeding the critical bandwidth. This corresponds to power summation over the band- 
width. Hence, loudness summation is certainly invalid within critical bands.) 

The failure of loudness summation is best illustrated by Stevens' empirical formula 
(ref. 6) for the overall loudness SC of broad- band noise. This formula was also adopted 
by Kryter (ref. 9) and many others for calculating the annoyance of noise. Stevens' 
formula is 


SC = + F 

m 



N 

y f & 

L~i n n 


n 


(3a) 


(3b) 
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(3c) 



where 1 ? is the loudness of the loudest of N frequency bands of noise enclosing the 
audible noise spectrum, F < 1 is an empirical function of the sound pressure (ref. 10), 
F n = 1 for n = m, and F n = F for n + m. Equation (3a) is the formula as presented 
by Stevens. When rewritten as equation (3b) it is clear that equation (3a) involves a 
weighted loudness summation. The third form, equation (3c), will permit ready com- 
parison with the power summation formula for loudness. Because F < 1, it follows 

N 

that !£ < ^ <£^, that is, the observed overall loudness is less than the sum of the loud- 

n 

nesses in the subbands. Therefore, if equation (3) is correct, then unweighted loudness 
summation fails. Equation (3) appears to possess an important fault. Specifically, the 
function F must be determined from the overall loudness. This means that, in order 
to compute the loudness of a sound, the loudness of that sound must already be known. 
For a given sound the function F can be computed from subjective loudness judgments, 
but it cannot be proved that the same function F will be the correct one for any other 
sound. In fact, examples will be presented in which equation (3) apparently fails. 

If loudness summation is replaced by power summation, the preceding difficulties 
do not arise. The loudness appears to be correctly determined by transmitted power 
summation, and the formulation does not involve functions, such as F, which may differ 
for each noise under study. 


OUTLINE OF POWER SUMMATION METHOD 

Because the mechanical portion (within the ear) of the auditory system is linear 
(ref. 11) and the electrochemical portion (the nervous system) is nonlinear (ref. 12), in 
that the amplitude of the transmitted whole-nerve signal is proportional to a power-law 
function of the sound pressure amplitude, the mathematical theory of loudness reported 
in reference 13 can be derived. It follows from this theory that, for broad-band noise, 
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where | | is the relative average power transmitted to the auditory cortex in each of 

N critical frequency bands throughout the auditory integration time. Note that loudness 
summation implies that the exponent s is applied before summing (eq. (3c)), whereas 
power summation (eq. (4)) implies that s is applied after summing. 

The critical-band phenomenon suggests that the sound spectrum should be resolved 
into bandwidths equal to critical bandwidths. The critical bandwidth is an increasing 
function of the band mid-frequency. For frequencies less than 500 hertz the critical 
bandwidths are approximately 100 hertz, whereas for frequencies greater than 500 hertz 
the critical bands are roughly one -third octave in width (ref. 8). 

The transmitted relative power in any given frequency band k is expressed in 
terms of measurable quantities according to 

TT k = T k>l < 5 > 

where T^ is the average transmittance of the internal auditory system (starting at the 

1L 2 

eardrum) in the k band and p. is the relative, mean- square sound pressure in the 
k band measured at the eardrum over the auditory integration time. Values of the 
entire transmittance (external plus internal) are obtained by inverting and normalizing 
equal loudness curves for pure tone stimuli, as measured, for example, by Fletcher 
and Munson (refs. 5 and 14) for listening with earphones, or Robinson and Dadson 
(ref. 15) for direct listening to plane, progressive waves incident from the front. (One 
should be wary about applying published transmittance curves for listening with ear- 
phones because the external transmittance may be a sensitive function of the earphone 
geometry and frequency response, the geometry of the observer's head, and the tight- 
ness of fit. ) It is convenient to let the T n designate internal transmittance functions 
encompassing the auditory pathway from the eardrum to the auditory cortex because this 
function is independent of the source-observer geometry. For direct listening the 
Robinson- Dadson curves include both internal filtering plus external effects caused by 
diffraction of the sound by the head and transmission of the waves along the external 
auditory meatus to the eardrum. The external and internal effects can be separated by 
considering data obtained by Wiener and Ross (ref. 16) concerning the ratio of sound 
pressure in the free field to that at the eardrum. From these data the mean-square 
pressure at the eardrum can easily be calculated from knowledge of the mean-square 
pressure in the free field. If the sound field is diffuse, rather than made up of plane 
progressive waves, then the external filter function can be modified by using the data of 
Robinson, Whittle, and Bowsher (ref. 17), which relate the pure-tone transmittance 
functions for these two types of noise sources. 


5 



LOUDNESS PREDICTIONS AND JUDGMENTS COMPARED 


The precision of loudness judgments is low in comparison with that of physical 
sound-pressure-level measurements simply because of deficiencies of the auditory sys- 
tem as an intensity measuring device. The standard deviation of repeated loudness 
judgments by one observer is of the order of 3 decibels, whereas the standard deviation 
for several observers, each making one observation, is of the order of 10 decibels. 

A basic experiment in loudness determination essentially consists of subjectively 
equating the loudness of two sounds. Most experiments have been of this type. Equally 
important are experiments which require quantitative judgments of the relative loudness 
of two sounds of unequal loudness. Very few experiments of this type have been per- 
formed. When equal- loudness judgments are used to test any loudness prediction 
scheme, only the precision of the scheme is really tested. Equal- loudness tests, alone, 
provide no information on the accuracy of the method. The accuracy can only be deter- 
mined by comparing predictions with unequal-loudness judgments. Fortunately, from 
the theoretical standpoint, most of the unequal-loudness tests consisted of comparisons 
of pure tones with the sound- pressure levels being noted (refs. 5 and 15). These 
tests have permitted the establishment of absolute relations between loudness and sound 
pressure for easily reproducible sounds, namely, pure tones. Then, the absolute loud- 
nesses of other sounds can be determined by subjectively equating them to pure tones. 
Unfortunately, very few experimenters have chosen to do this. Rather, most experi- 
menters have equated loudnesses of complex sounds, one of which they have called a 
standard, and thereby have been unable to establish the absolute loudness of any of the 
sounds tested. By this procedure only the precision (standard deviation) of predicted 
loudness levels of sounds judged equally loud can be computed, but the computed average 
loudness level has no absolute significance. Hence, the accuracy of the predictions 
cannot be determined. 

To demonstrate the validity of power summation, loudness levels were computed 
for sounds judged equally loud, with one sound a pure tone. Data of this type have been 
reported by Corliss and Winzer (ref. 18) and by Pearsons, Horonjeff, and Bishop 
(ref. 19). Although in both studies the loudness levels were not varied much, neverthe- 
less the accuracy of the power summation method can be demonstrated over a limited 
level range. (Assume that the test data are valid. Then, if Stevens' weighted- loudness - 
summation formula (eq. (3)) leads to accurate loudness predictions, this implies failure 
of unweighted loudness summation. If Stevens' formula fails, this logically indicates 
the failure of both the formula and the unweighted- loudness summation because the 
formula was originally developed from Stevens' own test data. ) 

Corliss and Winzer (ref. 18) presented loudness judgment data obtained using mag- 
netic tape recordings of tapping machines and women's footsteps on various floor mate- 
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rials. These sounds were equated in loudness to a 1 -kilohertz pure tone by nine ob- 
servers individually in a reverberant laboratory room. The subjective loudness levels 
in phons, as well as the loudness levels computed assuming loudness summation (Ste- 
vens, ref. 6, Zwicker, ref. 20) and power summation (this report) are shown in table I. 
(Subjective loudness levels in decibels are often called phons. ) In addition, the corre- 
sponding overall sound- pressure levels are shown, since Corliss and Winzer found that 
overall sound-pressure level was better correlated with subjective loudness judgments 
than with Stevens', and especially Zwicker 's, prediction methods. It is evident from 
table I that power summation yields better agreement with the subjective matches than 
does overall sound -pres sure level or loudness summation. The average absolute dis- 
agreement with subjective levels is 2. 5 decibels for power summation, 3. 5 decibels for 
overall sound- pressure level, 5. 5 decibels for Stevens' loudness summation method, 
and 11 decibels for Zwicker 's loudness summation method. The predictions by Stevens' 
method are not good, whereas those by Zwicker's method are certainly poor. In addi- 
tion, when the data for the different sounds are assembled in order of decreasing loud- 
ness, it becomes apparent that power summation predicts better ordering, namely, 1, 

2, 4, 5, 3, 6, than the other methods. 

Pearsons, Horonjeff, and Bishop reported equal loudness judgments by twenty sub- 
jects individually estimating the louder of two sounds (paired comparison method) in an 
anechoic chamber. Some data, listed in table n, were for comparisons between pure 
tones and octave bands of noise centered at the tone frequency. Loudness levels com- 
puted according to the method of power summation are also tabulated. For pure tones 
the loudness level equals the sound-pressure level plus the loudness-transmittance level 
of the entire auditory system. For a progressive wave consisting of a 1-kilohertz pure 
tone the transmittance is set equal to unity, so that the loudness-transmittance level 
equals zero. Then the loudness level equals the sound-pressure level. This equality 
at 1 kilohertz is observed in table n. Otherwise, the computed loudness levels for the 
pure tones are greater than the sound-pressure levels, as they should be for progres- 
sive waves at the frequencies considered since the external auditory system is a sound 
amplifier at these frequencies (ref. 16). At the higher frequencies the external filtering 
induced by the head causes considerable amplification (12 db at 4 kHz) of the signal 
reaching the eardrum (ref. 16). Thus, the calculated loudness levels are quite accurate. 
The differences between the computed loudness levels of the sound pairs effectively 
judged equally loud are listed in the last column of table n, where perfect agreement be- 
tween subjective judgments and computed values would correspond to a vanishing differ- 
ence. The average absolute difference is 2. 4 decibels. 

Pearsons, Horonjeff, and Bishop also reported on equal loudness matches between 
the octave bands of noise and octave bands of noise containing pure tones, between sim- 
ulated jet noise with and without a superposed pure tone, and between another broad- 
band noise with and without a superposed pure tone. With the possible exception of the 
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first case, only the difference between the computed loudness levels of the components 
of each pair are of interest because the absolute loudness levels cannot be established. 
For the entire set of tests the average absolute difference of the computed loudness 
levels of noises judged equally loud was 3 decibels when using the method of power 
summation. 

Another set of tests for which absolute loudness levels cannot be readily determined 
is that of Robinson and Bows her (ref. 21). Their judgment data for only one loudness 
level were obtained by having 570 observers, 10 at a time, make paired comparisons of 
tape-recorded aircraft noises in a "moderately" reverberant room. The conditions for 
equal loudness, as well as the corresponding predictions, are shown in table III. Pre- 
dicted annoyance levels are also included and are discussed in the next section. The 
standard deviations of these data in decibels relative to the average overall sound pres- 
sure, loudness, or annoyance are listed in table IV, wherein smaller numbers corre- 
spond to smaller standard deviations. For these data Zwicker's procedure exhibits the 
greatest precision, but power summation is a close second. When these data are com- 
pared with those of Corliss and Winzer, it again appears that the accuracy (as opposed 
to precision) obtained by assuming power summation is better than that obtained by 
assuming loudness summation. 

To show in the case of the Robinson -Bowsher data that power summation is probably 
more accurate than loudness summation, it should be recalled that over the most sensi- 
tive range of the internal auditory system, say, from 200 to 1500 hertz at moderate to 
high sound-pressure levels, the loudness level of a pure tone approximately equals its 
sound- pressure level. Since the loudness of noise within a critical band equals the loud- 
ness of a pure tone possessing the same sound pressure at the band midfrequency, it 
also follows that the loudness level of a critical band of noise contained within the high 
auditory-sensitivity interval (200 to 1500 Hz) approximately equals its sound-pressure 
level. If the noise bandwidth exceeds the critical bandwidth, then the question of the 
appropriate summation, loudness or power, arises. The noise spectra used by Robin- 
son and Bowsher are similar to those considered by Corliss and Winzer in that the 
energy tends to be concentrated in the high auditory-sensitivity interval. In the latter 
case, as already shown, power summation was the most accurate process by far, and 
the loudness levels predicted thereby agreed fairly well with the overall sound-pressure 
level (average absolute difference, 2. 6 db). The values given by Stevens procedure were 
about 5 decibels too high and those given by Zwicker's procedure about 11 decibels too 
high. The fact that predictions according to Zwicker's procedure are high has also been 
noted elsewhere (ref. 22). All the same results and conclusions follow from the 
Robins on -Bows her data. Hence, as before, power summation appears to yield a more 
accurate estimate of loudness than loudness summation. 
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ANNOYANCE 


Finally, it is worthwhile to consider the method of power summation to predict 
annoyance, since annoyance and loudness are closely related. If the attention of an 
individual is consciously directed toward perceiving sounds, then the individual is 
capable of making quantitative judgments of the relative annoyance of different sounds, 
just as he is able to make relative loudness judgments. On this basis Kryter (ref. 9) 
devised an empirical annoyance prediction procedure paralleling Stevens' loudness pre- 
diction procedure. In order to differentiate his annoyance levels from Stevens' loudness 
levels, Kryter designated annoyance levels in perceived noise decibels (PNdb). The 
annoyance level in PNdb is equivalent to annoyance expressed in decibels. It appears 
that the accuracy of the annoyance prediction scheme has never been tested. The con- 
census of many tests is that annoyance levels judged over short time periods are equal 
to Stevens' loudness levels (refs. 23 and 24) unless the judged sound contains relatively 
intense pure tone components (ref. 19), that is, contains information as opposed to 
noise. Then, the annoyance level exceeds the loudness level. Because of its close 
relation to Stevens' loudness prediction method, Kryter's annoyance prediction method 
tends to yield similar numerical results in the absence of pure tones. An example is 
illustrated in table in. If Stevens' loudness predictions are doubtful, then Kryter's 
annoyance predictions must also be doubtful. The doubtfulness of prior annoyance pre- 
dictions was apparently first suspected on theoretical grounds by Jones (ref. 24), who 
proposed power summation for computing annoyance based on a more elementary theory 
than in reference 13. More recently, Kryter (ref. 23) made the same proposal, but, in 
addition, noted that the sums must be over critical bands, whereas Jones rejected the 
significance of critical bands. The precision of the proposals was tested, but no 
attempts were made to test their accuracy. As it relates to these proposals, the loud- 
ness prediction procedure reported herein was independently derived from the theory in 
reference 13. The importance of the auditory integration time and the important effects 
of the noise source geometry and method of observation are included in the calculation 
procedure for the first time. Finally, the auditory transmittance function is based on 
tests using pure tones, rather than bands of noise. Pure tones can be accurately speci- 
fied and reproduced. In general, this is not possible for noise. 

Lewis Research Center, 

National Aeronautics and Space Administration, 

Cleveland, Ohio, March 25, 1971, 

129-01. 
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TABLE I. - COMPARISON OF MEASURED OVERALL SOUND- PRESSURE 
LEVEL a AND COMPUTED LOUDNESS LEVELS WITH SUBJECTIVE 
LOUDNESS- LEVEL JUDGMENTS FOR VARIOUS NOISES 


[Data from Corliss and Winzer, ref. 18.J 


Noise source 

Subjective 

loudness 

level, 

phons 

Overall 

sound- 

pressure 

level, 

db 

Loudness level, phons 

Stevens 
(ref. 6) 

Zwicker 
(ref. 20) 

Howes 
(this report) 

Tapping machine on - 






1. Concrete 

81 

81. 5 

86 

91 

83.6 

2. Vinyl tile over concrete 

80 

81 

85.5 

92 

82.6 

3. Oak blocks over concrete 

74.5 

78.5 

80.4 

86 

77.7 

Footsteps on - 






4. Concrete 

77. 8 

83 

86 

88 

81.5 

5. Vinyl tile over concrete 

75.8 

81 

82. 5 

86 

78.0 

6. Oak blocks over concrete 

74.5 

80.5 

77 

86 

74.0 


Referenced to 2x10 5 N-m'^. 


TABLE II. - COMPARISON OF MEASURED OVERALL SOUND- 
PRESSURE LEVELS a AND COMPUTED LOUDNESS LEVELS 
FOR JUDGED LOUDNESS EQUALITY OF PURE TONES 
AND OCTAVE BANDS OF NOISE CENTERED 
AT THE TONE FREQUENCY 
[Data from Pearsons, Horonjeff, and Bishop, ref. 19 ^ 


Tone 

frequency, 

Hz 

Octave band 

Pure tone 

Loudness- 

level 

difference, 

phons 

Overall 

sound- 

pressure 

level, 

db 

Loudness 

level, 

phons 

Overall 

sound- 

pressure 

level, 

db 

Loudness 

level, 

phons 

250 


98. 4 

97.0 

97.9 

0.5 


94.0 

95.4 

88.5 

90. 4 

5.0 

1 ' 

95.5 

97.4 

94. 5 

94.5 

2.9 


96.5 

102. 4 

97.5 

101.3 

1.1 


95.5 

106.3 

98.0 

108. 8 

-2.5 


Referenced to 2x10"^ N-m’ 2 . 





TABLE m. - COMPARISON OF MEASUREMENTS AND COMPUTED LOUDNESS 


AND ANNOYANCE LEVELS FOR VARIOUS AIRCRAFT SOUNDS 
JUDGED EQUALLY LOUD 


(Data from Robinson and Bowsher, ref. 21.] 


Sound 

Overall 

sound- 

pressure 

level, 

db 

Sound 
level A, 
db 

Loudness level, phons 

Annoyance 

level, 

PNdb 


Stevens 

Z wicker 

Howes 

Bristol helicopter 
hovering 

93.7 

87.6 

100.8 

104.7 

92. 4 

101.0 

Westland helicopter 
taking off 

96.8 

91,2 

102.8 

105.7 

94.2 

103.4 

Fairey Rotodyne 
taking off 

89.0 

85.9 

98.0 

103.8 

90.3 

100.7 

Fairey Rotodyne 
flying past 

96.3 

89.4 

101.9 

106.0 

91.8 

102.2 

Boeing 707/120 
climbing 

91.0 

88.8 

98,2 

105. 1 

91.3 

100.1 


TABLE IV. - STANDARD DEVIATION 


LEVELS a OF DATA IN TABLE III 


Measurement or calculation 

Standard 

deviation 

level, 

db 

Overall sound-pressure level 

-1.8 

Sound level A 

-3.5 

Loudness level from Stevens 

-3.3 

Loudness level from Zwicker 

-7.2 

Loudness level from Howes 

-4.5 

Annoyance level 

-4.9 


a Referenced to average values of sound 
pressure, loudness, or annoyance. 
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